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ANXIETY AS RELATED TO INCONGRUENCIES 
BETWEEN VALUES AND FEELINGS’ 
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Veterans Administration, Palo Alto 
and 
Stanford University 


Probably the most crucial problem in the understanding of the functional 
psychopathologies is the problem of anxiety. But despite the fact that anxiety is 
generally considered to be a significant factor in both normal and abnormal be- 
haviors, our general understanding of its causes and correlates remains unsatis- 
factory. In a recent approach to this problem the author (8) proposed that 
anxiety may fruitfully be conceptualized as being a function of the quantity of 
unassimilated perceptual material. The purposes of the present paper are to 
summarize this position, to report briefly on some preliminary research to which 
ithas led, and to describe a new technique being developed in connection with 
this research. 


RATIONALE 


In the course of living an individual undergoes many kinds of experiences, 
some anticipated and some unexpected, some sought and some due to the 
caprice of circumstance. It appears that such experiences, as realized phenom- 
enally by the individual—as perceived by him, do not typically remain unorgan- 
ized and in isolation, but rather tend to be incorporated into his overall apper- 
ceptive mass, i.e., into existent organized orderings of experience, which may 
be referred to as perceptual systems. The word assimilation can be used to refer 
to the process whereby an individual’s percepts enter harmoniously and con- 
gruously into relation with other percepts, and are incorporated in a concordant 
manner into perceptual systems. 


But on occasion an individual’s percepts are not readily assimilable into 
previously organized experiencings, i.e., they do not “fit” and may actually be 
incongruent with previously integrated material. It is postulated that—until and 
unless appropriate perceptual restructuring occurs—percepts which are not as- 
similated will tend to accumulate, such that it is meaningful to speak of the 
quantity of unassimilated? perceptual data. It is hypothesized that anxiety is a 
positive function of the magnitude of unassimilated percepts, i.e., that the greater 
the quantity of unassimilated percepts the greater the degree of anxiety.® 


This hypothesis could be tested most directly by testing the correlation be- 
tween amount of unassimilated material and level of anxiety. Since there ap- 
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“Assimilation can ibly best be conceived of as occurring in various degrees, but for simplicity of 
Presentation it is pane that percepts either are, or are - assimilated. saad 


‘Probably a better formulation, but one which was not explored in the studies here reported, would 
we that onxiety is a function of the ratio of unassimil material to the sum of assimilated plus 


pears to be no direct way of measuring the amount of unassimilated material, 
however, it is necessary to utilize some indirect means of evaluating this quantity, 


Probably the most important reason for percepts being unassimilable is the 
existence of incongruencies between them, or between the percepts involved 
and existent perceptual systems into which they would be assimilated.‘ By in. 
congruency is meant characteristics of relationships between percepts or percep. 
tual systems which make them difficult to reconcile or to harmonize, such as cen- 
tradictions, inconsistencies, and anomalies. Two examples of incongruent mat. 
erial may help to make this point clear. If one has feelings of love and affec. 
tion toward a given person, and yet this person behaves in a manner which 
arouses feelings of hostility, these new feelings might be somewhat difficult to 
assimilate. If one has high moral standards, and yet perceives that he has trans- 
gressed against these standards, these percepts might likewise be difficult to 
assimilate.® 


Because of the implied relationship between incongruence and unassini- 
lated material, it seemed reasonable to attempt to estimate the quantity of un- 
assimilated percepts indirectly—by evaluating the degree of incongruency in the 
personality. This attempt would be based on the supposition that if an individual 
has marked incongruencies in the way he perceives different things and events 
and persons in life, then in the normal course of his behaviors he is likely to en- 
counter a number of experiences which he cannot assimilate, and so will develop 
a large amount of unassimilated material. Hence, the above hypothesis can be 
restated to read: that anxiety is a positive function of incongruencies among 
one’s perceptions and the perceptual systems into which they would be assimi- 
lated. It is to be noted that it is not held that anxiety is a function of personality 
incongruency per se; rather, the extent of incongruency is to be used to estimate 
the level of unassimilated material. Nor is it maintained that all incongruencies 
among perceptual systems necessarily lead to unassimilated percepts; on the 
contrary, individuals presumably sometimes avoid percepts in certain experien- 
tial areas in which there is marked incongruency, and so do not build up a back- 
log of unassimilated material despite the incongruency. Accordingly, it can be 
held only that a measure of incongruency should furnish a very rough measure 
of the level of unassimilated material, and hence of anxiety. 


It is, of course, not possible—or in any event not feasible—to attempt to 
measure personality incongruency generally. One can hope only to measure 
certain kinds of incongruency. It was decided that a practical, first-attempt goal 
would be to measure incongruencies between values and feelings. The specific 
purpose of the studies to be described, then, was to test the hypothesis that 
degree of anxiety is positively related to extent of incongruency between values 
and feelings. 


The relation between difficulty in assimilation, on the one hand, and incon- 
gruency between values and feelings, on the other, may now be discussed in 
more detail. Suppose that an individual has the experience of “bossing some- 
one,” and that he perceives this experience both with regard to feelings (whether 
he likes or dislikes it) and values (whether he considers it good or bad). We are 
interested in how he assimilates his feelings and his evaluations of this exper- 
4Other factors which might contribute to difficulty in assimilation are: an excessively high rate of input 
of new percepts, a very marked degree of novelty, or variety, or change implied by the precepts, anc 


——- with fears or uncertainties concerned with the future, such that they cannot be assimi- 
lated now. 

5The concept of incongruency appears to be related to Festinger’s (3) notion of “dissonance,’’ to Peak's 
(9) concept of “disparity,” and to concepts of ambivalence and mental conflicts generally—since the 
latter terms refer to various incongruencies within the personality. 
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ience. In order to illustrate the view being proposed, let us postulate four small 
perceptual systems, one each for the reception and incorporation of the percepts 
of Good, Bad, Like, and Dislike. So long as “bossing someone” is perceived 
simply as good, it can readily be assimilated into the ‘Good’ perceptual system; 
when perceived solely as bad it can readily be assimilated into that perceptual 
system; and so on for like and dislike. When something is perceived solely as 
good, bad, liked, or disliked, then, the situation as to assimilation would be 
rather simple. 

We can suppose that small perceptual systems tend to become organized 
into larger, higher-order systems, that systems with a single dimension tend to 
be organized into larger systems with several dimensions. Such development of 
perceptual systems would tend to reduce overall complexity and would represent 
the integration of separate perceptual systems into larger systems (8, p. 296). In 
the present instance, it is reasonable to suppose that systems of Good and Like 
would tend to be combined into a single system, and Bad and Dislike into an- 
other. This supposition is based on the fact that the former concepts imply posi- 
tive, approved characteristics, and the latter concepts negative and disapproved 
characteristics. More formally, it is assumed that individuals, in increasing the 
degree of assimilation of their percepts, tend to integrate separate, specific per- 
ceptual systems into higher-order systems. This assumption, as it relates to percep- 
tual systems, is merely an extension of a general assumption that persons tend to 
assimilate percepts, i.e., that just as percepts tend to be assimilated into percep- 
tual systems, so smaller perceptual systems tend to be integrated into larger ones. 


If “bossing someone” is perceived as both good and liked, then, it could 
readily be assimilated into the larger perceptual system ‘Good-Like’; if it is per- 
ceived as both bad and disliked, it could be assimilated into the larger percep- 
tual system ‘Bad-Dislike’; but if it is perceived as either good and disliked, or bad 
and liked, it could not be assimilated into either of the larger systems. In other 
words, difficulties in assimilation are likely to arise when one’s percepts are in- 
congruent with respect to the systems in terms of which they tend to be classified. 
The illustrative example just given, as well as the studies to be reported here, are 
based on the assumption that adults in our culture develop perceptual systems 
in which percepts of Good and Like, and Bad and Dislike, tend to be integrated 
into higher-order perceptual systems. 


EXPERIMENTAL APPLICATIONS 


Three studies directed to the hypothesis that anxiety is positively related to 
extent of incongruency between values and feelings will be described. These 
studies are not presented as being crucial experiments for the present theory. 
Probably their results could be rationalized in terms of conflict theory or of mod- 
ern phenomenology. However, the experiments were developed in terms of the 
present theory, and are directly pertinent to it. They are best described as pre- 
liminary attempts to introduce empirical quantification into, and to develop 
specialized techniques for the study of, the proposed assimilation theory of 
anxiety. 


Study 1 


Subjects. — Twenty male NP patients on an admissions ward at the Palo 
Alto VA Hospital served as Ss. Including various diagnoses, they were selected 
by the ward staff as being capable of dealing successfully with the tasks used 
in the study. 


Materials and Procedures. — In order to obtain estimates of how anxious 
the Ss appeared, standardized interviews were conducted—by a person not 
otherwise involved in the study. The interviewer rated the Ss’ apparent anxiety 
on an 8-point scale. In order to obtain indications of how ‘anxious’ the Ss felt, 
each S was asked to rate himself on three 8-point scales, with regard to (a) how 
anxious he felt “right now,” (b) how anxious he had felt “generally today,” and 
(c) how anxious he felt “now as compared with the least and most anxious” he 
had ever felt. The last of the self-rating scales, which can be referred to as the 
self-comparison scale, was utilized because it was believed that it would provide 
an S with more meaningful end-points than the other two self-rating scales. 


A special technique was devised in order to obtain an estimate of extent 
of incongruency between values and feelings. A box, 20 x 20 x 4 in., was con. 
structed of wood, and the top lettered as indicated in Figure 1. Each of the 16 
squares in the grid had a slot cut into it through which small cards could be 
dropped into the cubicles below. Fifty-six statements, such as “marriage,” “read- 
ing the daily paper,” “obeying my parents,” “being an adult,” “carrying o 
grudge,” “my father,” “myself,” “to fight or quarrel,” “relaxing,” and the like, 
were typed on small cards.® Forty-six of these statements were selected as refer. 
ring to the kinds of experiences, events, or relationships about which incongru- 
encies might be expected to exist, and 10 were selected as referring to more 
innocuous matters. 
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Fig. 1. Layout of box used to measure incongruency. 


The S was given a tray containing the 56 cards in a predetermined random 
order. His task was to read the statement on each card and then, first, to decide 
how he evaluated the statement on the card—whether he considered it Mainly 
Good, Mainly Bad, Both Good and Bad, or Neither Good nor Bad, and secondly 
to decide how he felt about the statement—whether he considered his feeling to 
be Mostly Like, Mostly Dislike, Both Like and Dislike, or Feel Indifferent. On the 
basis of these two decisions, which determined both the appropriate row and 
the appropriate column, the S was to drop the card through the indicated slot 
in the top of the box. 


®Mimeographed copies of the test items, rating scales, and directions used in all three studies reported 
here, as well as more recently devised dene hg are available on request from the author. 
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It was reasoned that incongruencies between values and feelings would be 
indicated by the slots chosen. Thus, the diagonal squares—those labeled 0 in 
Figure 1—were assumed to reflect congruency between values and feelings. The 
two squares labeled 5 were assumed to indicate greatest incongruency. Incon- 
gruency values assigned to each of the other squares are indicated in Figure 1. 
All these assignments of values were done in advance on a judgmental basis. 
The values indicated in Figure 1 of course did not appear on the box used for 
testing. The incongruency score for each S was the sum of the values indicated 
by the squares into which he placed cards, the possible score range thus being 
from 0 to 280. 


All Ss were seen individually. The interview was conducted first, then the S 
filled out the self-rating scales, and then the incongruency procedure was con- 
ducted. 


Results. — Pearsonian correlation coefficients between incongruency scores 
and anxiety estimates were as follows: between incongruency and interviewer 
rating, .10; between incongruency and self-rating “right now,” .14; between in- 
congruency and self-rating “today generally,” .41; and between incongruency 
and self-rating “comparison,” .44. All of the coefficients are in the predicted 
direction, and the last two are significant at the .05 level on a one-tailed test. 
These results, though not confirmatory, tend to support the experimental hypo- 
thesis. It is to be noted that the correlations are highest for those anxiety esti- 
mates based on how the Ss themselves reported feeling. Intercorrelations be- 
tween the different estimates of anxiety were: between interviewer and self- 
rating “right now,” .32; between interviewer and self-rating “today generally,” 
.21; between interviewer and self-rating “comparison,” .02; and for the various 
self-ratings: “right now” and “today generally,” .66; “right now” and “com- 
parison,” .24; and “today generally” and “comparison,” .62. 


Study 2 

This study was undertaken in an attempt to extend the findings of Study 1, 
and in particular to try to develop a group test form of the incongruency tech- 
nique. 


Subjects. — The test procedures were group administered to 70 undergrad- 
vate college students, including 34 men and 36 women. During the course of 
administration it became apparent from Ss’ questions that many of them did not 
clearly understand the instructions for one part of the incongruency test. The Ss 
were therefore asked to indicate, on their test forms, whether they felt they had 
clearly understood, or had not clearly understood the instructions. Forty-two Ss 
indicated that they did not clearly understand the directions, and accordingly 
their test forms were considered invalid. This left an effective sample of 28 Ss 
(20 men and 8 women). 


Methods and Procedures. — Level of anxiety was estimated by the Taylor 
Manifest Anxiety Scale (12) and two self-rating scales. The two scales were not the 
same as those used in Study 1; rather, they represented attempts to improve upon 
those scales. One of these scales concerned how much stress and inner distress 
Shad been “feeling recently,” and the other was a “comparison” scale in which 
S$ was asked to indicate his present level of inner distress in terms of the worst 
and best he had ever felt. The word “anxiety” was not used in these scales, and 
the attempt was made to construct them at a level of sophistication appropriate 
to college students. 
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In order to estimate extent of incongruency between values and feelings two 
forms were prepared. One was called Survey of Likes and Dislikes, and included 
100 items of the same type used in Study 1, each of which was to be marked o; 
Mostly Like, Mostly Dislike, Both Like and Dislike, or Feel Indifferent. The other 
form was called Evaluations Form, and included 98 items of the same type, each 
of which was to be checked as Mainly Good, Mainly Bad, Both Good and Bad, 
or Neither Good nor Bad. Eighty-one items were common to both forms, and 
the incongruency score for each S was based upon the degree of incongruency 
between his indicated evaluations and feelings about the content of these 8] 
items. The Ss were not told that the two forms would be combined in this manner, 
The method of scoring was the same as in Study 1, except that squares given 
zero credit on Figure 1 were now given 1 point. 


The tests were given in the following order: Survey of Likes and Dislikes, 
Taylor Anxiety Scale, Evaluations Form, and self-rating scales. 


Results. — For the 28 Ss who indicated that they understood the instructions, 
and who could be considered as having taken the tests in a valid manner, the 
following correlation coefficients were obtained: between incongruency and 
Taylor MAS, .45; between incongruency and self-rating “recently,” .47; and 
between incongruency and self-rating “comparison,” .46. All of these co- 
efficients are significant at the .01 level on a one-tailed test. Intercorrelations 
between the different estimates of anxiety were: between Taylor MAS and self- 
rating “recently,” .40; between Taylor MAS and self-rating “comparison,” .49; 
and between the two self-ratings, .49. 


The fact that so many Ss failed to comprehend the test instructions is indi- 
cative of the inadequacy of the instructions, and represents a problem which 
must be overcome before the incongruency technique can become a useful ex- 
perimental tool for group use. However, the data obtained upon the 28 Ss who 
reported that they understood the instructions clearly does not appear to have 
been biased in any way by the fact that other Ss did not understand clearly, and 
is thus acceptable os evidence with regard to the hypothesis being tested.” 


College Board Scholastic Aptitude Test scores were available for the 28 Ss, 
and it was determined that the correlation between verbal aptitude and incon- 
gruency was .21, and between mathematical aptitude and incongruency .23. 
Neither of these coefficients are significant at the .05 level, and thus they suggest 
that incongruency scores are not a function of intelligence. 


Study 3 


In this study the same test materials and procedures were used as in Study 
2, except that NP patients were used as Ss and were tested individually. The 
purpose of the study was not only to examine further the relation between in- 
congruency and anxiety, but also to test the hypothesis that the mean of incon- 
gruency scores would be higher for the NP patients than for the students. 


Subjects. — Forty-six male patients, mostly schizophrenics from continued- 
treatment wards, were seen for examination. Unlike the Ss in Study 1, these Ss 
were not selected in terms of their being probably capable of dealing adequately 
with the test materials. For various reasons, not all of the test measures were ob- 
tained on each S, so that N’s for the various comparisons varied slightly. 
7Intercorrelations between incongruency and anxiety for the 42 Ss who reported that they did not 
Copy enterstond the instructions were all insignificant, and were as follows: between incongruency 


and Taylor MAS, .18; between incongruency and self-rating “recently,” —.01; and between incon- 
gruency and self-rating “comparison,” .07. 
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Materials and Procedures. — As already noted, the same test materials were 
ued as in Study 2. The order of testing in most instances was: Evaluations 
form, Self-ratings, Survey of Likes and Dislikes, and Taylor MAS. 


Results. — Coefficients of correlation between incongruency and the anxiety 
estimates were as follows: between incongruency and Taylor MAS (N = 42), .17; 
between incongruency and self-rating “recently” (N = 45), .08; and between 
incongruency and self-rating “comparison” (N = 45), —.11. None of the co- 
efficients are supportive of the major hypothesis. Comparison of the students 
and patients on incongruency gave the following result. For 45 patients the 
mean incongruency score was 49.73, with SD — 19.94; for 28 students the mean 
was 41.18, with SD = 17.95. Comparison of the means yielded a ¢ of 1.82, 
which with 71 df is significant at the .05 level on a one-tailed test. While the 
two groups differed on incongruency, the difference was not great, and there 
was considerable overlap. 


With regard to the insignificant correlations between incongruency and 
anxiety, which are contrary to the findings in Studies 1 and 2, the implication is 
either that the major hypothesis is not valid, or that for some reason the measur- 
ing instruments did not furnish valid estimates in this study. With regard to the 
latter, it seems possible that the test instructions and the self-rating scales were 
worded at too sophisticated a level for the patients, even though all Ss reported 
that they understood clearly what they were to do. Another possibility is that 
some of the patients answered the Taylor MAS in a defensive and invalid man- 
ner. This possibility is rendered more reasonable by the following finding. MMPI 
records were available on 35 of the Ss, and since the Taylor MAS includes items 
from the MMPI it was reasoned that certain MMPI scores could be utilized to 
identify Ss who might have answered the Taylor MAS in an invalid manner. Ac- 
cordingly, Ss were eliminated whose L, F, or K scores on the MMPI were over 70. 
This left 21 Ss. For these 21 Ss the correlation between incongruency and Taylor 
MAS was .54, which is significant at the .01 level on a one-tailed test. 


DISCUSSION 


The studies described above, viewed in their totality, lend at least tentative 
support to the hypothesis of a positive relationship between anxiety and extent 
of incongruency between values and feelings. However, in view of the exper- 
imental difficulty encountered in Study 2—in clearly conveying instructions to 
the Ss, and in view of the largely negative results obtained in Study 3, it cannot 
be concluded that the existence of the predicted relationship has been un- 
equivocally established. Rather, the studies reported here should be viewed as 
preliminary experiments, and the need for further research is indicated. Such 
research should focus not only on improved measures of incongruency, but also 
on improved measures of anxiety. The general technique used for measuring 
incongruency in the present studies appears to have considerable potential. 
Further exploration with this technique will be aimed at the problems of item 
selection, reliability and clarity of instructions. 


We may examine further the relationship between anxiety and incon- 
gruency. The present theory holds that incongruencies of all kinds, provided 
they are not in areas in which the S avoids perceptions, are related to anxiety, 
whereas in the present studies we dealt only with incongruencies between values 
and feelings, and these only within a rather limited range. This selectivity was 
necessary for reasons of experimental expediency, but if the theory is valid, then 
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an examination of the literature might reveal other studies which can be re. 
interpreted within the context of the present view. 


For example, in a study of religious attitudes and manifest anxiety in col- 
lege students, Funk (5) found a correlation of .43 between anxiety and “religious 
conflict.” Since conflictive material can be interpreted as incongruent and un- 
assimilated, this finding clearly is in accord with the present theoretical view. 


Also pertinent to the present conceptualization of incongruency are studies 
which involve Ss using Q sorts in order to rate themselves, first, as to the way 
they see themselves and second, as to the way they would like to be. The corre- 
lation between these two sortings, often said to reflect degree of self-esteem or 
self-satisfaction, is directly a measure of a particular kind of “congruency,” and 
hence, according to the present view, should be negatively related to level of 
anxiety. In a study by Block and Thomas (1) there is reported a correlation of 
—.69 between this particular measure of congruency and the Pt scale of the 
MMPI. Inasmuch as Pt is highly correlated with the Taylor A-scale (2), this 
amounts to a substantiation of the present interpretation. 


Q-sort correlations between self and ideal ratings have been most used in 
studies conducted under the aegis of Carl Rogers. Rogers (10, 11) conceives of 
the degree of incongruency between self and ideal ratings—as reflected by low 
intercorrelations—as indicative of the degree of maladjustment. This interpreto- 
tion, however, has led to difficulties (1, 6), one of the reasons therefor being that 
there is evidence (4, 7) that paranoid schizophrenics do not have a great deol 
more incongruency than do normals.® Since it cannot be maintained that para- 
noid schizophrenics are well adjusted, it seems doubtful that this particular 
measure of incongruency—between self and ideal ratings—can be defended as 
a measure of maladjustment. It appears to me, however, that such a measure 
can be interpreted, even for paranoid schizophrenics, in accordance with the 
view that it reflects to some extent the level of unassimilated material, and hence 
of anxiety. This interpretation would imply that paranoid schizophrenics are not 
necessarily notably more anxious than normals, but it would not imply that they 
are almost as well adjusted as normals. One can suppose that paranoid schizo- 
phrenics have constructed peculiar conceptual systems which, though consensually 
invalid, permit them to assimilate material which for them would otherwise be 
unassimilable, thus keeping anxiety relatively low. And, of course, it is just this 
peculiar conceptual schema which connotes the paranoid condition. 


Friedman (4) did find evidence for what in the present context would be 
termed marked incongruency in paranoid schizophrenics when he obtained the 
correlations between their consciously stated self concepts and their inferred self 
concepts based on TAT stories. These correlations, as compared with normal Ss, 
were very low. It is to be noted, however, that this kind of incongruency, though 
it would imply abnormality, would not, according to the present theoretical view, 
necessarily imply anxiety. This is because the incongruency is based upon data 
which the S avoids the preception of, and so he would not build up a mass of 
unassimilated material. As pointed out earlier, incongruencies do not neces- 
sarily imply unassimilated material, but do so only to the extent that the in- 
dividual actually obtains percepts in the areas of incongruency. 


The present view of perceptual assimilation and incongruencies among 


percepts is in many ways quite similar to that of Rogers. Rogers (10, 11) has 
been interested in the extent to which incoming perceptual experiences are 


8Friedi 4) reported lations between self and ideal rati of .63, .03, and .43 for normals, 
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assimilated into an individual's concept of himself, and he conceives of percepts 
inconsistent with the self picture as posing threat and creating tension. Anxiety 
is related to the individual’s awareness of a discrepancy between his needs and 
his self picture. Rogers is consequently concerned with incongruencies between 
self and ideal perceptions, which is indicative of degree of maladjustment. The 
present view is that all kinds of incongruency among the variety of things one 
perceives—not just between self and ideal—help to determine the level of un- 
assimilated material, and that this level determines the extent of anxiety, whether 
or not the S is aware of the incongruencies per se. Further, it appears that we 
must keep quite separate the concepts of anxiety and maladjustment. 


SUMMARY 


It is proposed that anxiety is a function of the magnitude of perceptual 
material which an individual has not assimilated. It is further proposed that a 
major reason for percepts being unassimilated is the degree to which they are 
incongruent with each other, or with perceptual systems into which they would 
be assimilated. It follows that degree of anxiety should be positively related to 
the extent of incongruency within the personality in those areas in which an in- 
dividual does not avoid perceiving. Since it is not practicable to attempt to 
measure overall personality incongruency, it is necessary, in order to evaluate 
this hypothesis experimentally, to choose a specific kind of incongruency. This 
paper describes three studies designed to test the specific hypothesis that anxiety 
is related to incongruency between values and feelings. 


The Ss were asked to indicate for each of a number of items what their 
feelings (Like, Dislike) were and what their evaluations (Good, Bad) were. The 
extent of discrepancies between their answers was taken as a measure of incon- 
gruency. Anxiety was estimated by interviewer ratings, by self-ratings, and by 
the Taylor Manifest Anxiety Scale. Two of the studies used psychiatric patients as 
Ss, and the other used college students. 


Viewed in their totality, the findings are interpreted as tending to support 
the hypothesis, though not in an unequivoca! and conclusive manner. 
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A CONTROLLED EXPLORATORY INVESTIGATION INTO THE 
EFFECTS OF THORAZINE UPON MENTAL TEST SCORES 
OF CHRONIC HOSPITALIZED SCHIZOPHRENICS’ 


JOHN E. NICKOLS, JR. 
The Hertzler Clinic 


This project was designed primarily for the purpose of applying rigid scien- 
tific procedures to the task of investigating the basic effects of Thorazine upon 
mental efficiency. The problem area, mental efficiency, has been identified by 
Castner, Covington and Nickols (1). They proposed that Thorazine should be 
capable of lessening the disrupting effects of emotional disturbance, which in 
turn allows the S greater efficiency in setting his capacity to the task of dealing 
with immediate reality, particularly in relation to the less strictly intellectual func- 
tions. However, their conclusion was based upon the results of one small sample 
study that made no use of a control group. 


The hypothesis is that a study of placebo versus Thorazine treatments will 
yield no significant differences in mean gains of mental efficiency for hospitalized 
chronic schizophrenic patients. The independent variable can be defined as 
Thorazine treatments and the dependent variable as mental efficiency. Because 
mental efficiency is not readily sampled in terms of absolute measures, gains and 
losses in test scores between successive testings were used to estimate change in 
mental efficiency.” 


SUBJECTS AND PROCEDURES 


Initially, the Ss consisted of an accidental sample of 40 chronic, female 
schizophrenic patients of an lowa state hospital. Chronic wards were visited un- 
systematically and the first 40 suitable patients coming to the psychologist’s atten- 
tion were selected. Among other criteria of selection, all patients were believed 


‘This research was conducted at the Mental Health Institute, Independence, lowa, as a special-interest 
project under the approval and sponsorship of the lowa State Board of Control, particularly through 
the efforts of Board Member, Robert C. Lappan, and age Director-Advisor, Lowell Schenke. 

author is deeply indebted to his dissertation committee of Denver University: Lawrence Miller, 
Ph. D., Wilber Miller, Ph. D., Harry Moore, Ph. D., and particularly Stuart Boyd, Ph. D., who serv 
os faculty advisor, and Joel Green, Ph. D., who served as statistical advisor. He also wishes to 

wledge consultation conferences with Leigh Peck, Ph. D., University of Texas, and ur 
Benton, Ph. D., and John Martire, Ph. D., University of lowa. 


The author wishes to express his thanks to the many employees of the State of lowa who 
contributed to the progress of this research. J. O. Cromwell, M. D., Superintendent, approved and 
backed the research at the hospital level. Special acknowledgment is given to James Thomas for 
all administrations of the evaluative tests, his assistance in scoring the tests and assistance in super- 
vising personnel, and for directing much of the follow-u ee J in the author’s absence. Dr. J. 
Pietrzak and Dr. Henry Koseroski contributed their medical services. Substantial contributions were 

le by the Director of Nursing, Mrs. E. Kay, her female supervisory and psychiatric aide staff, 
and by J. Meuli, Coordinator of Special Activities, and his staff. Acknowledgment is also given to 
je ng student, Loyd Johnston, who administered the screening tests and assisted in the statistical 

yses. 


The author also wishes to thank the Smith, Kline and French Laboratories, which furnished all 
drugs, placebos and translations of foreign publications used for this project, and the secretarial 
of the Hertzler Clinic, Halstead, Kansas, which contributed typing services. 


‘The nature of “mental efficiency’’ is discussed elsewhere (3). In brief, it is used as a broader concept 
than intellectual efficiency,” allows for lowered reliability of most tests when used with chronic 
izophrenics, infers that measurement is stabilized upon some relatively fixed and pertinent standard 
enominator, and can be used clinically to imply the existence of a relationship between the in- 
dividual’s present performance level ona his actual mental capacity in coping with any task. 


to have been free from Thorazine treatments for at least six months. Distributions 
of age and durations of illness for the sample do not deviate significantly from 
those for resident schizophrenics of the hospital or total population in lowa state 
hospitals during 1951 to 1954. 


Two groups of 20 Ss were matched on a group basis for age and for initial 
raw scores on the Graham-Kendall Test and Ammons Full-Range Picture Vocabv. 
lary Test. Attempts were also made to match the two groups on (a) complicating 
medical diagnoses, such as diabetes and syphilis, (b) lobotomized patients, and 
(c) recent somatic treatments. Statistical analysis revealed satisfactory compari- 
sons on these and other variables. Other variables included (a) habitual level of 
psychomotor activity, (b) duration of hospitalization, (c) previous somatic treat. 
ment, (d) diagnoses, (e) intelligence quotients, and (f) time residing on the re- 
search ward prior to evaluative testing. 


Ss were divided into four sections, with each including relatively well 
matched sub-groups of five control and five experimental Ss. Each section was 
bedded at random in one of four partially partitioned dormitories of the 
research ward. 


The research ward was one of two newly and identically constructed wards 
to house chronic female patients. The two closed wards shared in common a 
large sitting and activities room. 


Ward conditions were highly controlled so as to approach a laboratory 
situation as nearly as possible in a state hospital setting. Two psychiatric aides, 
especially selected for the project, were given specific mimeographed instruc- 
tions. Privileges and activities were kept to a minimum, and those extended to 
any one S were also allowed to all 10 patients of her section. Activities included 
(a) some work on the ward, (b) television viewing and pastime activities in the 
common lounge, (c) short group walks and trips to the Canteen, (d) weekly 
dances and movies, and (e) up to two-hour week-end visits with relatives on the 
hospital grounds. Each S did not take up all activities. 


Treatment consisted of a double-blind-treatment method. Ward personnel 
did not realize that placebos were used until after all research was terminated. 
They were told that the project was intended to test the effects of two drugs. Tab- 
lets had identical markings, whereas almost all Thorazine ampules were marked 
with a red dot and each placebo ampule had a blue ring. Each S's individualized 
drug supply was marked only with that patient’s name, but personnel soon 
speculated that one drug was Thorazine and the other a different tranquilizer. 


For each section, separate treatment schedules were placed on the ward. 
Ss received injections for the first week, then were switched to oral dosages. 
Daily dosages ranged up to, and included, a maximum of 400 mg. for each S. 
Total dosages were approximately 8,000 mg. during the first four to five weeks 
and 15,900 mg. for the full nine week program. 


Gains in mental efficiency were measured primarily by weighted scores on 
the Wechsler-Bellevue Scales. These scales were administered before treatment, 
after four to five weeks, and after nine weeks of treatment (see Tables 2 and 3). 
Additional criteria tests, the Arthur Point Stencil Design Test, Form II°, and Ror- 
schach Test, were usually administered the afternoon of the same day on which 
any one S took the Wechsler-Bellevue Scales. Criterion testing was controlled 
for (a) examiner, (b) testing procedures, (c) fatigue, (d) physical environment, 


8The author could find no standardized time credits for this test. Time credits were calculated by 
use of the probable errors for time credits obtained by the 40 Ss on initial testing. 
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(e) immediate effects of Thorazine, (f) time and sequence of testing, and (g) 
familiarity with testing and test materials. Scorings were re-checked at least 
twice, using at least two qualified scorers. 


A screening battery of tests was administered by a practicum student. It 
consisted of (a) the Draw-A-Person Test, (b) Five Blot Test*, (c) Graham-Kendall 
Memory-For-Designs Test, (d) Ammons Full-Range Picture Vocabulary Test, and 
(e) Facial Expression Pictures Test.° The battery was used for (a) screening, elim- 
inating three untestable and three very deteriorated Ss, (b) matching, (c) con- 
trolling testing variables, and (d) speculations as to the effects of Thorazine treat- 
ments. Most of these tests were re-administered one to three days prior to re- 
testing with the evaluative criteria. 


RESULTS 


Thirty-six Ss completed the first four to five weeks of the study and 32 Ss 
completed the full nine weeks of treatment. One psychologist, a practicum stu- 
dent in psychology and two psychiatric aides were asked independently to rate 
patients according to specific standards of behavioral and psychiatric change 
during treatment. 


TABLE 1 
RESULTS OF PERSONNEL RATINGS FOR THE SYSTEMATIC STUDY* 


Average Percentages of Ss Percentage of Agreement 
Responding to Treatment** Between Judges *** 
Outstanding Only for Ss 
Improvement Improved Worse Showing Change _ For All Ss 
Psychological 
Ratings 0 Oo 27 «25 3.7 6 17 56 77 
Psychiatric 
Aides 17 17 54 56 0 0 42 33 72 67 
C = Control Group E = Experimental Group 


* None of the differences are statistically significant, as evaluated by small sample t-ratios 
for proportions. All averaged percentage values of 25 or more are significant from 
zero, to the five per cent level of confidence, and values of 42 or more significant to 
the one per cent level of confidence. 

** Each of these figures is based upon percentages of total patients and is expressed as an 
overall average for two judges who rated each S on psychiatric and behavioral im- 
provement. Ratings for the first four to five and full nine week treatments were 
almost identical for any one judge. 

*** These columns express the extent to which judges of each set agreed upon psychiatric and 
behavioral responses of individual Ss. Ratings for the first four to five and full nine 
week treatments were almost identical for any one judge. 

‘This test was specially compiled for this poled | by the author. It consists . 4, five acromatic photo- 


graphs of paint blots and administered like the perfo of Rorschach, to serve 
as a familiarizing ‘ter the 


“This test was devised by the author, who has been collecting unanalyzed data for five years, ‘and 
interpretations are made strictly from the more ‘Gassatieal clinical frame-of-reference. The test 


consists of twelve acromatic photographs of paint blots, vaguely touched up to resemble human 
or animal faces. For this research, each S was asked if each card (a) resembled most closely animal, 
wman, plant or object, and (b) appeared pleasant, unpleasant, a mixture, or neither. 


Table 1 shows low ratings,of improvement and very low agreement between 
judges for the psychological ratings of Ss improved. These raters were very cav- 
tious and observed the Ss for relatively short and scattered periods of time. The 
two psychiatric aides, who spent 40 hours per week on the research ward, show 
average judgments of approximately 50% improvement, including 17% out. 
standing improvement, for each group. For each group, ward personnel agree 
on at least two-thirds of their ratings and that at least one-third of each group 
improved. These ratings suggest that enough improvement occurred to allow 
psychological testing to sample the change, but they also suggest little differ- 
ences between the groups. 


TABLE 2 
RESULTS ON THE EVALUATIVE CRITERIA FOR VARIOUS DURATIONS AND PROGRAMS OF 
THORAZINE TREATMENTS (f-ratios of 1.00 or less are indicated by “’x’’) 


t for 
Program Mean Gains Mean Gain t for Uncorrelated Confidence 
Differences Mean Gains Mean Gain Levels for 
Differences Differences 
Cc E E—C E E—C E—C 
Wechsler-Bellevue Verbal Scale Gains (weighted scores) 
3.30 «1.5 —18 2.9 1.0 x 
i3 1606«(«1 0.3 x 1.1 x 
i4 49 23 —2.7 2.3 x x 
Wechsler-Bellevue Performance Scale Gains (weighted scores) 
i2 2.1 4.2 2.2 15 3.6 1.2 24 
44 74 3.0 47 48 1.7 10 
i4 54 8.5 3.1 2.9 48 1.2 .24 
Wechsler-Bellevue Full Scale Gains (weighted scores) 
i2 5.3 5.7 0.4 2.6 3.0 x 
60 9.3 3.3 2.4 3.3 x 
i4 10.3 10.8 0.4 3.6 2.8 x 


C = Control Group E = Experimental Group 


i2; N = 18 Results for the first four to five weeks of systematic treatments. Form II used on 
post-testing while Ss were on maintenance dosages of 100 mg. daily. 

i3: N = 16 Results for nine weeks of systematic treatments. Form | repeated after one week 

of no treatment. 

Results for nine weeks of systematic treatments plus 12 weeks follow-up treatments 

and activities. Form II repeated for post-testing after Ss were off Thorazine for 

at least one week. 


i4: N = 12 


Results of evaluations with standardized tests are presented in Tables 2 and 
3. Table 2 shows that for the first four to five weeks of treatments the null hypo- 
theses cannot be rejected significantly for any of the three Wechsler-Bellevue 
Scales. For the Verbal and Full Scales the results suggest that this situation is 
unaltered by a longer duration of treatment and re-testing without the immediate 
effects of Thorazine. Performance Scale results show that the Experimental 
Group had increased from a 2.2 to a 3.0 weighted score lead at the end of the 
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full nine weeks of treatments, and that the null hypothesis of no difference be- 
tween the groups can be rejected at the 10 per cent level of confidence. One 
may speculate that this difference would reach significance, should it be pos- 
sible to take into consideration the correlation due to matching groups. 


TABLE 3 
RESULTS OF TESTING WITH THE ARTHUR POINT STENCIL 
DESIGNS TEST, FORM II, BEFORE AND AFTER VARIOUS DURATIONS AND PROGRAMS 
OF THORAZINE TREATMENTS 


(t-ratios of 1.00 or less are indicated by “’x’’) 


t for 
Program Mean Gains Mean Gain t for Uncorrelated Probability 
Differences Mean Gains Mean Gain for Mean Gain 
Differences Differences 
Cc E E—C Cc E E--C E—C 


Arthur Point Stencil Designs Test, Form II (raw scores) 


i2 1.4 0.3 —1.1 18 x 12 .24 
i3 1.4 0.8 —0.6 22 17 x 
i4 1.4 0.0 —1.4 1.3 21 


i2 89 2.3 —6.7 2.6 1.1 Ls AB 

i3 11.5 48 —6.8 2.7 1.6 1.3 .20 

i4 10.6 2.8 —78 2.1 x 1.1 .28 
C = Control Group E = Experimental Group 


See Table 2 for interpretation of program symbols. 


Results of the Arthur Point Stencil Designs Test show no significant dif- 
ferences between the groups, but the results consistently favor the Control Group 
for the first four to five and full nine week programs. Time credits, improvised for 
the test, served only to accentuate the Control Group’s lead. This suggests that 
the probable action of Thorazine in facilitating improvement on Performance 
tasks is attributable to the nature of the tasks, or mental processes involved in 
coping with them, rather than to the time factor alone. It also suggests that ap- 
propriate use of the time factor with the more homogeneous tests should allow 
more effective use of item difficulty and an increase in precision of criteria 
iesting. 


EXTENDED STUDY 


An attempt was made to estimate the stability of the results by the use of 
a total-push treatment program, using most of the same Ss of each group. 
The present study was extended for an additional 12 weeks. This program in- 
cluded Thorazine and placebo treatments, in conjunction with (a) industrial 
therapy, (b) extended privileges, including ground passes for up to 11 ex- 
perimental and control Ss, and (c) twenty hours per week of activity therapy, 
including social, recreational, occupational, art and music therapies. Two psy- 
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chiatric aides had been asked to estimate daily dosages at which they had 
observed best behavioral responses while each S was participating in the sys. 
tematic study. Dosages averaged approximately 200 mg. daily for each group, 


Tables 2 and 3 indicate that results for the extended program are con. 
sistent with those for the first four to five weeks of the systematic study, and 
that no significant differences were found between the groups. The patterning 
of results for the three periods suggest that one should not overlook the pos. 
sibility that a large sample replication study of increased experimental preci- 
sion may show that Thorazine is better able to effect improvement of per- 
formance functions, but that the Control group may improve more on the 
Stencil designs and possibly on verbal functions. It is noteworthy that some 
mean gain differences for the total study (i4) are larger but less significant 
than those for the systematic study (i3); this may be due to the fact that the 
smaller sample did not allow the same statistical precision for the total study. 
The small test gains made during the longer and more intense follow-up study 
suggest the possibility that (a) repeated re-testing may have allowed the Ss to 
approach their improvement potential before participating in the total-push 
program, and/or (b) even though systematic and total-push treatment condi- 
tions are similarly capable of testing the effects of Thorazine, experimental pre- 
cision should be increased by using the total-push program as an independent 
study to parallel systematic conditions. 


RORSCHACH ANALYSIS 


The Rorschach test was administered before treatment, after the systematic 
study, and after the follow-up, total-push program. For initial reaction time 
the Experimental Group shows significant decreases to acromatic, .color, and 
all 10 cards for both treatment programs. Differences between the groups were 
significant for five of the six comparisons. These results suggest that Thorazine 
possesses the property of increasing spontaneity in chronic schizophrenics. 


Small sample t-ratios for proportions were used to evaluate changes in 17 
other scoring variables, for all Ss whose post-testing scores differed from their 
pre-testing scores. Probabilities for differences between the groups tend to 
distribute according to chance. That personality changes occurring during Thora- 
zine treatments are likely to be highly individualistic is suggested from the ob- 
servation that, while pre- and post-testing results differ markedly for the in- 
dividual S, there is a general lack of consistency in direction and magnitude of 
change among the scoring categories for the group as a whole. 


Directions of change on variables showing provisionally significant differ- 
ences (5 to 10 per cent levels of confidence) tend to correspond to tentative 
conclusions made previously by the author (1). Using one-half values for addi- 
tional responses, total numbers of responses tend to increase for experimental 
Ss, whereas sums for Y, color and popular responses tend to decrease. Although 
these trends were present for both the systematic and full 21 week treatment, 
only sum Y decreases were provisionally significant for both programs. These 
trends suggest that Thorazine helps lower the impact of emotional stress or 
threat and alleviates emotional disturbances or responsiveness, particularly in 
relation to external stimulation. FC decreases account for most of the decrease 
in color response, whereas Fe responses do not tend to change. This suggests 
that alleviation of emotional discomfort will not likely be accompanied by the 
development of more meaningful or satisfying affective or emotional relation- 
ships with social reality. On the contrary, the decrease in popular responses 
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suggests that conformity had become less important after treatment, and a gen- 
eral lack of change in human responses suggests no increase of interest in 
others. The increase in total number of responses suggests that the Ss ex- 
perienced no loss of mental drive. No significant changes were found for ap- 
proach and content. 


ADDITIONAL COMPARISONS 


Results of the systematic and follow-up programs of the present study were 
also compared with results of the Preliminary (Rusk) Study (1). This permits 
further speculation on (a) the stability of the results, and (b) identifying the 
possible weaknesses of the present study. 


The Preliminary Study consisted of four and one-half weeks of Thorazine 
treatment and of total-push treatment efforts. Facilitating efforts included di- 
dactic group psychotherapy, extended privileges, recreation and social thera- 
pies, and some industrial and occupational therapy. In addition, 5 of the 10 
Ss received two weeks of only recreational and social therapies and an addi- 
tional five weeks of Thorazine treatment. No control group was used. 


TABLE 4 
A COMPARISON OF THE PRESENT STUDY WITH THE PRELIMINARY STUDY 
FOR RESULTS OF WECHSLER-BELLEVUE SCALES WITH VOCABULARY AND OBJECT 
ASSEMBLY TESTS OMITTED 
(t-ratios of 1.00 or less are indicated by “x’’) 


Mean Weighted t for Mean Gain Confidence Levels 
Program Score Differences Uncorrelated Differences for Differences 
R—C R—C R—C 
Full 
Scale i2 10.7 11.7 3.3 3.9 01 01 
Weighted i3 99 8.0 3.0 2.0 01 06 
Score i4 7.6 69 19 1.6 .07 13 
Gains 
Verbal 
Scale i2 1.3 2.8 x 1.2 — .24 
Weighted i3 2.4 19 x x 
Score i4 —0.1 1.7 x x 
Gains 
Performance 
Scale i2 9.5 8.8 5.3 5.6 01 01 
Weighted i3 7.5 6.2 49 2.1 01 05 
a i4 7.7 5.2 3.8 2.7 01 02 
ins 


R: Results of the preliminary study, or Rusk Group. (No control group was used.) 
C: Results of the control group of the present, or Independence study. 
E: Results of the experimental group of the present, or Independence study. 


See Table 2 for interpretation of program symbols. 
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These comparisons, presented in Table 4, show no significant differences 
between the Preliminary Group and either group of the present study for 
Wechsler-Bellevue Verbal Scale gains, whereas the Preliminary Group did sig- 
nificantly better than either Independence group on Performance Scale gains 
for all three programs. This suggests that at least the Performance Scale results 
of the present study may be duplicated in other Thorazine-centered programs, 
With the possible exceptions of the initial use of a total-push treatment and 
use of 12 sessions of didactic group psychotherapy, the less prolonged and no 
more intense preliminary program suggests that S variables account for its 
superior results over the present study on Performance and Full Scale gains. 
Probable S variables are that the Preliminary Study included substantially 
greater percentages of Ss who (a) had not previously received any form of 
tranquilizing drugs, (b) had been hospitalized for less than one year, (c) were 
active and overactive, psychomotorly, and (d) were acutely or emotionally 
disturbed. However, these variables may also indicate favorable prognoses 
for any psychiatric treatment. 


Results for all individual tests and Wechsler-Bellevue sub-tests used in this 
study were inspected for statistical significance and consistencies of gains. 
Control Ss improved significantly more on the Full-Range Picture Vocabulary 
Test. They tended to improve substantially more on the Arthur Point Stencil De- 
signs, Test Form Il, and on the Arithmetic Test. Experimental Ss did significantly 
better on affect perception of the Facial Expression Pictures, or in attributing 
affectual meanings in common with 20 senior nursing students, and on the Ob- 
ject Assembly Test. They tended to improve substantially better on the Picture 
Arrangement Test and the Digit Symbol Test. 


Pearson product-moment correlations were run between each of four S$ 
variables and the three Wechsler-Bellevue Scales, for the gains made during 
the systematic and systematic plus total-push treatments. Rho correlations were 
also computed with composite gain scores (Wechsler-Bellevue Full Scale plus 
Stencil Designs plus one-fourth Full Range minus one-fourth Graham-Kendall 
score gains). 


Twenty-two comparisons were made for each of four S variables. The num- 
ber of significant differences between the groups tend to distribute within 
chance expectancies, but certain precautions may be suggested for precise re- 
search. For 18 of the 22 comparisons age correlated negatively, and deteriora- 
tion (initial Graham-Kendall scores) positively, with improvements in mental 
efficiency, but use of only Ss between 25 and 50 yr. tended to erase the age 
trend. Duration of hospitalization does not seem to be related to therapeutic 
improvement, except where durations are of less than four, particularly of less 
than two, years. Initial Full-Range score comparisons were inconsistent. 


Five rank order correlations were computed to compare each initial stand- 
ardized criterion test and each Rorschach variable with composite mental 
efficiency gains. No significant correlations or differences were found between 
therapeutic gains, on the one hand, and initial Stencil Designs scores, with or 
without time credits, initial Full Scale, Verbal or Performance 1.Q.’s, on the 
other hand. For 25 Rorschach variables, significant differences tended to dis- 
tribute according to chance, but future research might give principal considera- 
tion to the study of initial reaction time, sum Y responses, particularly C’ re- 
sponses, and the color ratio. Other variables may include whole responses, FC 
responses, and possibly Dd and total number of responses. 
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For the reader who wishes to speculate on prognostic indicators, which may 
be of value in selecting Ss specifically for Thorazine treatments, Rho correla- 
tions between initial Rorschach variables and composite score gains suggest 
that better therapeutic responses may be associated with initially higher sum 
W, sum Y and sum C’ responses, and initially lower color-ratios. 


SUMMARY AND DISCUSSION 


This project was designed as a matched group investigation into the effects 
of Thorazine upon the mental efficiency of chronic, hospitalized schizophrenics. 
Sixteen experimental and control Ss completed nine weeks of treatment under 
laboratory-like conditions, and 12 Ss of each group completed an additional 
12 weeks of total-push treatment conditions. Statistical analyses of mean gains 
do not allow significant rejection of the null hypotheses for Wechsler-Bellevue 
Verbal, Performance, or Full Scales Test and Arthur Point Stencil Designs Test. 
This suggests that Thorazine is incapable of producing clear-cut and significantly 
measurable effects on mental efficiency, for the Ss, experimental precision, and 
conditions employed in this study. Rorschach evaluations and personnel ratings 
also failed to show significant evidence that the effects of Thorazine differ sub- 
stantially from those obtained with placebos under the same therapeutic condi- 
tions. 

Several factors suggest that, in order to interpret safely the essentially 
negative character of the results, the implications of several trends in the results 
of the present study should be ruled out in future research, by use of large 
sample replication studies of increased experimental precision. Possible weak- 
nesses of this study have been discussed in the text. 


Inspection of trends of change among the results allow several speculations 
which should be followed up in future research. Thorazine treatments seem to 
be consistently associated with markedly improved efficiency in coping with 
performance tasks; but the drug seems to be ineffective, if not disadvantageous 
for improving performances to certain verbal or the more strictly intellectual 
functions. Projective test evaluations suggest that Thorazine may allow increased 
spontaneity and alleviate emotional responsiveness and experiences of stress 
reactions to external stimulation, which do not seem to be accompanied by 
improvement in the basic illness, mental content, or meaningful personal re- 
lationships with the environment. 


The overall pattern of results suggests that the drug may effect favorable 
influences upon disorders that are due primarily to the disrupting effects of 
emotional disturbances upon mental efficiency and use of previously established 
habit-formations and behavioral control, particularly where the pathological 
reaction-sequence is secondary to and/or serves to alleviate emotional dis- 
comfort aroused by external stress-provoking stimuli, but impotent or possibly 
resistant to helping disorders that are due primarily to aberrations of mental 
content or the more strictly intellectual functions. These speculations suggest 
that future research should investigate the possibility that Thorazine is effective 
with basic emotional and social disorders, but not necessarily and possibly 
disadvantageous with basic mental and intellectual disorders. More specific 
speculations can be extracted from the text of this paper. 
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PERSPECTIVES IN PSYCHOLOGY 


Vil. THE CRITERIA OF VALIDITY IN OBSERVATIONAL ANALYSIS’ 


PAUL SWARTZ 
University of Wichita 


The main body of psychology started its career by putting 
the wrong foot forward and it has been out of step with 
the march of science much of the time since. Instead of 
beginning with studies of the whole person adjusting to 
a natural social environment, it began with studies of a 
segment of a person responding to a physical stimulus in 
an unnatural laboratory environment. Consequently, after 
a century of diligent application, psychologists still lack 
sufficient ordered knowledge of everyday social behavior. 
Their attempts to overcome this handicap by adding or as- 
sociating in some way the psychological processes which 
have been scientifically investigated in the laboratory have 
not been notably successful. 

—Assessment of Men. 


Several recent commentaries on the current situation in psychology have 
raised questions regarding the validity of the experimental approach to be- 
havior. The suggestion is made that experimentation has been overstressed in 
psychological work and that the possibilty of making greater use of observa- 
tional techniques should be carefully explored. In an especially frank and 
perceptive formulation of this argument, Paul Lazarsfeld (7) has expressed 
doubt that such complex processes as voting or buying behavior are amenable 
to experimental analysis. “You can experiment with how people bet,” suggests 
lazarsfeld. “But if you want to know how people vote, and how people buy, 
| don’t think you can really experiment. You have to do systematic analytical 
observations.” 


The present writer has stated the issue in more general terms. 


Simple behaviors, activities in which physiological processes 
predominate, behaviors coordinated with minimal stimulation 
and stimulus deprivation—with these forms of action a valid 
experimental psychology seems not only possible but dis- 
tinctly probable. But what of the more complex behaviors, 
in which conduct is coordinated with very complex patterns 
of stimulation and in which the factors of past behavior 
history and social conditioning are of paramount importance? 
What is the outlook here (12, p. 122)? 


Part of this paper was ted at the annual meeting of the Kansas Psychological Association in 
Wichita, Kansas, on ‘April 19, 1958. ” 


Tentatively, the position is taken that while “we must not give up experi- 
menting with complex behaviors,” provision should be made for the possibility 
that “the really significant ‘break-throughs’ ” in understanding may result from 
“the perceptive observations of the non-experimentalist ‘amateur’ and _non- 
experimentalist professional,” rather than from controlled laboratory study. 


Observational analyses of human behavior are apt to be regarded with 
scornful disapproval by the defenders of orthodoxy in psychology and are liable 
to the charge of anecdotalism. “There is an Idol of the Laboratory,” writes 
Joseph Wood Krutch, “as well as of the Market Place” (5). Intolerant and 
unenlightened as this attitude may appear, there is considerable justification 
for it in the continuing, centuries-long struggle to establish psychology as a 
science. The observational approach is not susceptible of such precise definition 
or circumscription as the experimental approach. Neither is it as reliably self. 
corrective. It can easily degenerate into a fruitless, paralyzing rationalism. If 
the lessons of history are to be our guide, then the burden of proof clearly rests 
on those who propose the observational approach and not on their critics. 
“The disillusion with the laboratory” (4), must not be allowed to usher in the 
return of subjectivism or prelude the surrender of scientific method. The in- 
tent of the observational approach is to broaden the scope and significance of 
psychological inquiry, not to emasculate it. 


The question which must be answered, and answered well, by the observa- 
tional psychologist is the question of criteria. What standards exist for judging 
the validity of conclusions based on observational findings? Are the traditional 
criteria of prediction and control applicable in observational analysis? Or are 
there other standards to invoke? In the present paper we propose to make a 
preliminary examination of this problem. No solutions are offered; the prob- 
lem admits of no easy solution. We have only a few suggestions to make, 
which we hope will help clarify the problem and bring certain aspects of it 
into sharper focus. 


THE CRITERIA OF VALIDITY IN OBSERVATIONAL ANALYSIS 


The observational approach in psychological work may assume several dif- 
ferent forms. For our purposes we can distinguish (a) systematic analytical ob- 
servation, as proposed by Lazarsfeld (7), (b) clinical observation, and (c) non- 
systematic, reflective observation, as embodied in the art of the novelist or 
playright and in the work of the professional psychologist who prefers a non- 
systematic, natural behavior-world type of study. Each of these three observa- 
tional methods incorporates certain unique features. It is therefore necessary 
to consider them singly in discussing the question of criteria of validity. 


Systematic Analytical Observation 


The systematic analytical observation of behavior is well described by 
Lazarsfeld in a brief reference to some of his own work. 


Let me come back once more to the type of voting studies | 
am especially interested in. We take a sample of people in, 
say, Elmira, New York, and interview them every month. We 
ask them for whom they want to vote, and then see how they 
change from month to month. We have done that with car 
buyers; we have kept potential car buyers under observation 
for nine months and have tried to study the decision process 
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by re-interviewing the same people in various stages of their 
approach to a final purchase. 


Many such decisions can best be studied by making re- 
peated observations and seeing how the outcome at one 
point affects the turn of affairs in the next time period. We 
have data on disc jockeys . . . . How do they decide which 
record to play next? We made very detailed observation 
with people sitting beside those disc jockeys and watching 
how they pick the record, and then we interviewed those disc 
jockeys. We can tell what records they play next as a result 
of what telephone calls come in. . . (7, pp. 102-103). 


Of the three forms of observational study that we have distinguished, it 
is obviously systematic analytical observation that most nearly resembles ex- 
perimental research. By the careful selection of behavior samples and the 
skillful design and standardization of procedures, the controlled conditions of 
study characteristic of the laboratory may be significantly approximated by 
the psychologist who uses this method. Furthermore, systematic analytical ob- 
servation lends itself to quantitative treatment. Lazarsfeld, for example, in 
connection with his own work, suggests the relevancy of time series analysis. 
“The models to be looked for,” he writes, “have a similarity to business cycle 
analysis of the economists” (7, p. 103). 


It should be readily apparent that the determination of the validity of con- 
clusions about behavior derived from systematic analytical observation does 
not present a special problem. The criterion of prediction is clearly applicable. 
We have already cited one example of applying this standard in describing 
lazarsfeld’s disc jockey study. Successful prediction based on conclusions de- 
rived from observational analysis increases the likelihood of their being valid. 
Unsuccessful prediction casts doubt on their validity. 


Clinical Observation 


The second type of observational method that we have distinguished is 
clinical observation. Psychologists, psychoanalysts, and psychiatrists have all 
paid considerable attention to the question of validating knowledge derived 
in the clinical situation. The problems here are very serious and very complex. 
A good, representative introduction to the basic issues involved can be found 
in the 1950 Hixon Lectures on Psychoanalysis as Science (11). Mutatis mutandis, 
the observations made by the participants in this series can be applied to any 
attempt to validate clinical knowledge, regardless of the particular theoretical 
orientation of the clinical investigator. 


The viewpoints expressed by the participants in the Hixon Lectures are 
sufficiently diverse and well-developed to provide a basis for our discussion. 
Hilgard (3), of course, upholds the possibility of controlled, laboratory valida- 
tion of psychoanalytic concepts and techniques. Although he recognizes that 
“so many of the experiments give merely trivial illustrations of what psycho- 
analysts have demonstrated to their own satisfaction in clinical work,” and 
that “experimental work thus far bears most directly only on the most superfi- 
cial aspects of psychoanalytic theory, while many of its deeper problems are 
scarcely touched,” nevertheless, he stoutly maintains that “if we are able to 
design experiments appropriate to the more superficial aspects, we can move 
on to deeper stages.” He cautions, however, that “laboratory experimentation 
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that preserves anything like the richness of a psychoanalysis will be very diffi. 
cult indeed, if not, perhaps, impossible.” 


Hilgard’s optimism regarding an experimental approach to psychoanalysis 
is not shared by E. Pumpian-Mindlin, himself a psychoanalyst. In response to the 
question: “Are, then, psychoanalytic data and theory verifiable in the laboro. 
tory?” he writes: 

| am afraid that we must face the fact that they are not sub- 
ject to direct verification in this manner at the present time. 
Derivative data, such as the mechanisms of defense or the 
presence of conflicts and of unconscious forces, may be veri- 
fied, but the detailed derivation of these factors in the ana- 
lytic sense at present cannot be subjected to any type of 
experimental validation, except by the use of the analytical 
method itself. | do not underestimate the type of experimen- 
tation and clinical validation offered by Dr. Hilgard in his 
lectures . . . but the extension of this material beyond these 
limits is not yet feasible (10, p. 155). 


This does not mean, continues Pumpian-Mindlin, that the theory and data of 
psychoanalysis cannot at all be validated. While the results will not satisfy 
the stringent criteria employed in the biological and physical sciences, “it is 
possible,” he writes, 


to obtain objective evidence of at least certain general 
hypotheses of psychoanalysis. Certainly it is possible to in- 
vestigate in the laboratory the nature and quality of the 
underlying forces present in human beings, as to a certain 
extent has already been done (p. 155). 


The subject matter of psychoanalysis, argues Pumpian-Mindlin, is “on a 
different integrative level” than that of the “exact sciences.” The definiteness 
and specificity demanded of hypotheses and principles in these sciences can- 
not be expected in psychoanalysis. Psychoanalytic observation cannot be as 
highly controlled as investigation in the physical and biological disciplines. At 
present, it is concluded, psychoanalysis must be content with “establishing 
what appear to be significant, but not exclusive, correlations rather than spe- 
cific causal relationships.” 


To this writer the most cogent analysis presented in the Hixon Lectures is 
that formulated by Lawrence S. Kubie (6). His argument is brilliantly conceived 
and masterfully executed. No review can hope to do it justice. It must be 
read in toto. All that we will attempt to do in this paper is to underscore a 
few of its major points. 


The experimental scientist, writes Dr. Kubie, does not at this time know 
enough about psychoanalysis to determine what questions are appropriate and 
significant. The first task of the experimentalist, he observes, is to: 


make himself thoroughly familiar with phenomena as these 
occur in nature, ascertaining what can be proved with 
the unaided eye and ear before deciding what to subject to 
experimental verification. Otherwise, investigators may use 
complex methods to prove something which needs no prov- 
ing, precisely because it is on the surface for all to observe, 
either during the naive phases of childhood or in the facts 
of illness which are familiar to the clinician (p. 64). 
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laboratory validation of such data, continues Kubie, is redundant, having 
mainly an educational value. Of course, he adds, the precision and refinement 
of data which the laboratory makes possible is not superfluous. 


My argument is solely against the uselessness of making 
pallid facsimiles in the laboratory of data which are already 
manifest in nature, merely to get around the human reluc- 
tance to look human nature in the eye. This comment is 
relevant to no small part of the experimental work that has 
been documented by Sears . . . , some of which was referred 
to by Professor Hilgard. Many of these laboratory charades 
are pedestrian and limited demonstrations of things which 
have been proved over and over again in real life... . 
Experimental facilities should not be wasted on issues which 
are already clearly proved, and to which human bias alone 
continues to blind us. The experimentalist should rather take 
up where the naturalist leaves off (pp. 64-65). 


In this lucid and courageous exposition of one of the fundamental prin- 
ciples of scientific work, Kubie has pointed unerringly to the major criticism 
that the observational student of behavior makes of traditional experimentation 
in psychology, namely, its almost studied neglect of events in the natural 
behavior-world. How much of our so-called basic research in psychology pales 
beside the richness and variety of real life activity! How much is but the con- 
trolled investigation of laboratory fictions! Lazarsfeld has deplored this self- 
imposed limitation on psychological work in a brief, but penetrating discussion 
of the data of science. He writes: 


Experiments might be the only source of data for the physicist 
and they might be useful for the social scientist. But they 
are for us certainly not the main material we are working 
with today. Don’t forget that we have data for something like 
2,500 years. Plato has developed very interesting proposi- 
tions on human behavior in society and, if we now come and 
develop models which somehow organize or clarify what 
Plato has to say, we are doing an important job. It isn’t true 
that the only data in the social sciences are those which have 
been collected in the laboratories in the last 50 years (7, p. 
99). 

There is one apparent weakness in Kubie’s analysis which the present writer 
readily acknowledges. How does one distinguish between data which require 
no experimental verification because they are “already manifest in nature,” 
and data which appear to be but are not really manifest in nature? How, in 
other words, does the naturalist avoid the error of making “the world is flat’ 
type of assertion? 


The problem is not insoluble, although the solution is not fool-proof. We 
assume, first of all, that our naturalistic investigator, like the cultural anthro- 
pologist, is a well-trained student of behavior. Second, we assume that only 
those data concerning which a reasonable consensus can be reached will be 
accepted as “manifest in nature.” Third, we assume that where the results of 
naturalistic observation contradict the already accepted body of knowledge, 
more complex methods of verification, not necessarily experimental in nature, 
will be applied. Will these safeguards eliminate all error? Certainly not. But 
no science has or ever will devise an error-free investigative technique. If there 


is any possibility of advancing our understanding of human behavior by nat. 
uralistic observation, we should certainly be willing to bear the risks involved, 
recognizing, of course, that this is only one phase of the investigative operation, 
Hilgard expresses some appreciation of the usefulness of naturalistic study when 
he writes: “If we are wise enough to make the most of naturalistic observations, 
we may find some clues that will help us to design better experiments” (3, p. 35), 

“The major contribution of experimental science,” contends Kubie, “will not 
be limited to confirming psychoanalytic observations in the laboratory but 
rather to providing psychoanalysis with instruments of greater qualitative and 
quantitative precision.” The first problem to be solved is how to record ade- 
quately the therapeutic process “without at the same time distorting the process." 
Kubie proposes an institute for research in psychoanalytic psychology, an or. 
ganization which would coordinate the work of many disciplines: neuro- 
physiology, biophysics, biochemistry, pharmacology, clinical psychology, cul- 
tural anthropology, biostatistics and, of course, psychoanalysis. He calls such 
an institute “a dream for the future.” 

In our limited judgment Kubie has correctly stated the relationship between 
clinical observation and controlled experimentation, and has argued con- 
vincingly for the validity of the former as a means of studying behavior. We 
would add only one step to his program, namely, a sociological analysis of 
the general and specific cultural circumstances in which the clinical knowledge 
is derived. Pumpian-Mindlin hints at the necessity for this when he calls at- 
tention to the fact that Freud “arose in a definite historical period and was 
subject to its influences.” If we accept the relationist thesis of the sociology of 
knowledge, viz., “that it lies in the nature of certain assertions that they cannot 
be formulated absolutely, but only in terms of the perspective of a given 
situation,” and if we assume that assertions about human behavior are mainly 
of this type, then it logically follows that the first step in determining the 
validity of psychological knowledge must be “an analysis based on the sociology 
of knowledge” (9). 


Non-systematic, Reflective Observation 


We come now to the third type of observational analysis, namely, non- 
systematic, reflective observation, as embodied in the art of the novelist or 
playright and in the work of the professional psychologist who prefers a non- 
systematic, natural behavior-world type of study. Here we are on dangerous 
ground. Judged by the standards of modern, experimental psychology, this 
approach to behavior is easily the least respectable of all the observational 
methods. Even though many psychologists would agree with Verplanck (13), 
that novelists and playrights “do a remarkably good job of giving plausible ac- 
counts of behavior, often in terms that seem pertinent,” nevertheless, these same 
psychologists would probably acquiesce in the prevailing opinion that “literary 
psychology” is an alien rite in the House of Science. Allport’s plea (1) for 
pluralism in the methods of psychology, as expressed in his belief that “what- 
ever contributes to a knowledge of human nature, is an admissible method to 
yw dl seems not to have had much impact on traditional psychological 

ing. 

The psychologist’s refusal to entertain seriously the possibility that literary 
creation can make a significant contribution to the understanding of human 
behavior rests, we believe, in what Karl Mannheim (9) would call “a one-sided 
and narrow orientation to the problem of knowledge.” Consider the following 
observation, relative to the question of scientific vs. so-called pre-scientific 
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knowledge and understanding. It appears in Mannheim’s classic work, Ideology 
and Utopia. Compare it to the typical “holier-than-thou” attitude of the ex- 
perimental psychologist. We believe that the program suggested is an emi- 
nently sane and practical one, which might even have as a beneficial secondary 
effect, the re-establishment and strengthening of the psychologist’s contact with 
real life behavior. 


If, however, it is true, that life affords possibilities of 
knowledge and understanding even where science plays no 
part, it is no solution to designate such knowledge as “‘pre- 
scientific’ or to relegate it to the sphere of “intuition,” 
simply in order to preserve the purity of an arbitrary defi- 
nition of “science.” On the contrary, it is above all our duty to 
inquire into the inner nature of these still unformulated types 
of knowledge and then to learn whether the horizons and 
conceptions of science cannot be so extended as to include 
these ostensibly pre-scientific areas of knowledge (p. 165). 


The problem of validating knowledge derived from non-systematic, re- 
flective observation can be best handled in two parts, corresponding to the 
two extreme instances of this form of knowledge. There is, first of all, knowledge 
derived from non-systematic, reflective observation that is confirmed in sys- 
tematic analytical observation, clinical observation, or experimental analysis. 
This confirmation may be either intentional or accidental, witting or unwitting. 
Assuming that we can eventually agree on a set of criteria for the validity of 
clinically derived knowledge, such confirmation would seem to provide pre- 
sumptive support for the validity of the non-systematically, reflectively derived 
knowledge. 


One of the best examples of this manner of validating knowledge can be 
found in the frequent clinical confirmation of ambivalence as an important 
characteristic of affective behavior. Freud (2) attributes the coining of this term 
to Bleuler. Yet, but for the naming of the behavior, the ambivalent reaction was 
described with great insight over two thousand years ago by Lucretius (8), the 
celebrated Roman poet. In Book IV of his poem, The Nature of the Universe, in 
which Lucretius discusses sensation and sex, he writes: 


Do not think that by avoiding grand passions you are 
missing the delights of Venus. Rather, you are reaping such 
profits as carry with them no penalty. Rest assured that this 
pleasure is enjoyed in a purer form by the healthy than by 
the love-sick. Lovers’ passion is storm-tossed, even in the 
moment of fruition, by waves of delusion and incertitude. 
They cannot make up their mind what to enjoy first with eye 
or hand. They clasp the object of their longing so tightly that 
the embrace is painful. They kiss so fiercely that teeth are 
driven into lips. All this because their pleasure is not pure, 
but they are goaded by an underlying impulse to hurt the 
thing, whatever it may be, that gives rise to these budding 
shoots of madness (pp. 163-164). 

This example is but one of many that could be cited in which non-sys- 


tematically, reflectively derived knowledge, through clinical or experimental 
confirmation, clearly makes a contribution to psyc | understanding. 


Two thousand years, we would all agree, is a long time to wait for clinical 
confirmation of a naturalistic observation. Surely, there must be some alterna- 
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that preserves anything like the richness of a psychoanalysis will be very diffi. 
cult indeed, if not, perhaps, impossible.” 


Hilgard’s optimism regarding an experimental approach to psychoanalysis 
is not shared by E. Pumpian-Mindlin, himself a psychoanalyst. In response to the 
question: “Are, then, psychoanalytic data and theory verifiable in the labora. 
tory?” he writes: 

| am afraid that we must face the fact that they are not sub- 
ject to direct verification in this manner at the present time. 
Derivative data, such as the mechanisms of defense or the 
presence of conflicts and of unconscious forces, may be veri- 
fied, but the detailed derivation of these factors in the ana- 
lytic sense at present cannot be subjected to any type of 
experimental validation, except by the use of the analytical 
method itself. | do not underestimate the type of experimen- 
tation and clinical validation offered by Dr. Hilgard in his 
lectures . . . but the extension of this material beyond these 
limits is not yet feasible (10, p. 155). 


This does not mean, continues Pumpian-Mindlin, that the theory and data of 
psychoanalysis cannot at all be validated. While the results will not satisfy 
the stringent criteria employed in the biological and physical sciences, “it is 
possible,” he writes, 


to obtain objective evidence of at least certain general 
hypotheses of psychoanalysis. Certainly it is possible to in- 
vestigate in the laboratory the nature and quality of the 
underlying forces present in human beings, as to a certain 
extent has already been done (p. 155). 


The subject matter of psychoanalysis, argues Pumpian-Mindlin, is “on a 
different integrative ievel” than that of the “exact sciences.” The definiteness 
and specificity demanded of hypotheses and principles in these sciences can- 
not be expected in psychoanalysis. Psychoanalytic observation cannot be as 
highly controlled as investigation in the physical and biological disciplines. At 
present, it is concluded, psychoanalysis must be content with “establishing 
what appear to be significant, but not exclusive, correlations rather than spe- 
cific causal relationships.” 


To this writer the most cogent analysis presented in the Hixon Lectures is 
that formulated by Lawrence S. Kubie (6). His argument is brilliantly conceived 
and masterfully executed. No review can hope to do it justice. It must be 
read in toto. All that we will attempt to do in this paper is to underscore a 
few of its major points. 


The experimental scientist, writes Dr. Kubie, does not at this time know 
enough about psychoanalysis to determine what questions are appropriate and 
significant. The first task of the experimentalist, he observes, is to: 


make himself thoroughly familiar with phenomeno as these 
occur in nature, ascertaining what can be proved with 
the unaided eye and ear before deciding what to subject to 
experimental verification. Otherwise, investigators may use 
complex methods to prove something which needs no prov- 
ing, precisely because it is on the surface for all to observe, 
either during the naive phases of childhood or in the facts 
of illness which are familiar to the clinician (p. 64). 
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Laboratory validation of such data, continues Kubie, is redundant, having 
mainly an educational value. Of course, he adds, the precision and refinement 
of data which the laboratory makes possible is not superfluous. 


My argument is solely against the uselessness of making 
pallid facsimiles in the laboratory of data which are already 
manifest in nature, merely to get around the human reluc- 
tance to look human nature in the eye. This comment is 
relevant to no small part of the experimental work that has 
been documented by Sears . . . , some of which was referred 
to by Professor Hilgard. Many of these laboratory charades 
are pedestrian and limited demonstrations of things which 
have been proved over and over again in real life... . 
Experimental facilities should not be wasted on issues which 
are already clearly proved, and to which human bias alone 
continues to blind us. The experimentalist should rather take 
up where the naturalist leaves off (pp. 64-65). 


In this lucid and courageous exposition of one of the fundamental prin- 
ciples of scientific work, Kubie has pointed unerringly to the major criticism 
that the observational student of behavior makes of traditional experimentation 
in psychology, namely, its almost studied neglect of events in the natural 
behavior-world. How much of our so-called basic research in psychology pales 
beside the richness and variety of real life activity! How much is but the con- 
trolled investigation of laboratory fictions! Lazarsfeld has deplored this self- 
imposed limitation on psychological work in a brief, but penetrating discussion 
of the data of science. He writes: 


Experiments might be the only source of data for the physicist 
and they might be useful for the social scientist. But they 
are for us certainly not the main material we are working 
with today. Don’t forget that we have data for something like 
2,500 years. Plato has developed very interesting proposi- 
tions on human behavior in society and, if we now come and 
develop models which somehow organize or clarify what 
Plato has to say, we are doing an important job. It isn’t true 
that the only data in the social sciences are those which have 
been collected in the laboratories in the last 50 years (7, p. 
99). 

There is one apparent weakness in Kubie’s analysis which the present writer 
readily acknowledges. How does one distinguish between data which require 
no experimental verification because they are “already manifest in nature,” 
and data which appear to be but are not really manifest in nature? How, in 
other words, does the naturalist avoid the error of making “the world is flat’ 
type of assertion? 


The problem is not insoluble, although the solution is not fool-proof. We 
assume, first of all, that our naturalistic investigator, like the cultural anthro- 
pologist, is a well-trained student of behavior. Second, we assume that only 
those data concerning which a reasonable consensus can be reached will be 
accepted as “manifest in nature.” Third, we assume that where the results of 
naturalistic observation contradict the already accepted body of knowledge, 
more complex methods of verification, not necessarily experimental in nature, 
will be applied. Will these safeguards eliminate all error? Certainly not. But 
no science has or ever will devise an error-free investigative technique. If there 
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is any possibility of advancing our understanding of human behavior by nat. 
uralistic observation, we should certainly be willing to bear the risks involved, 
recognizing, of course, that this is only one phase of the investigative operation, 
Hilgard expresses some appreciation of the usefulness of naturalistic study when 
he writes: “If we are wise enough to make the most of naturalistic observations, 
we may find some clues that will help us to design better experiments” (3, p. 35), 

“The major contribution of experimental science,” contends Kubie, “will not 
be limited to confirming psychoanalytic observations in the laboratory but 
rather to providing psychoanalysis with instruments of greater qualitative and 
quantitative precision.” The first problem to be solved is how to record ade- 
quately the therapeutic process “without at the same time distorting the process.” 
Kubie proposes an institute for research in psychoanalytic psychology, an or- 
ganization which would coordinate the work of many disciplines: neuro- 
physiology, biophysics, biochemistry, pharmacology, clinical psychology, cul- 
tural anthropology, biostatistics and, of course, psychoanalysis. He calls such 
an institute “a dream for the future.” 

In our limited judgment Kubie has correctly stated the relationship between 
clinical observation and controlled experimentation, and has argued con- 
vincingly for the validity of the former as a means of studying behavior. We 
would add only one step to his program, namely, a sociological analysis of 
the general and specific cultural circumstances in which the clinical knowledge 
is derived. Pumpian-Mindlin hints at the necessity for this when he calls at- 
tention to the fact that Freud “arose in a definite historical period and was 
subject to its influences.” If we accept the relationist thesis of the sociology of 
knowledge, viz., “that it lies in the nature of certain assertions that they cannot 
be formulated absolutely, but only in terms of the perspective of a given 
situation,” and if we assume that assertions about human behavior are mainly 
of this type, then it logically follows that the first step in determining the 
validity of psychological knowledge must be “an analysis based on the sociology 
of knowledge” (9). 


Non-systematic, Reflective Observation 


We come now to the third type of observational analysis, namely, non- 
systematic, reflective observation, as embodied in the art of the novelist or 
playright and in the work of the professional psychologist who prefers a non- 
systematic, natural behavior-world type of study. Here we are on dangerous 
ground. Judged by the standards of modern, experimental psychology, this 
approach to behavior is easily the least respectable of all the observational 
methods. Even though many psychologists would agree with Verplanck (13), 
that novelists and playrights “do a remarkably good job of giving plausible ac- 
counts of behavior, often in terms that seem pertinent,” nevertheless, these same 
psychologists would probably acquiesce in the prevailing opinion that “literary 
psychology” is an alien rite in the House of Science. Allport’s plea (1) for 
pluralism in the methods of psychology, as expressed in his belief that “what- 
ever contributes to a knowledge of human nature, is an admissible method to 
science,” seems not to have had much impact on traditional psychological 
thinking. 

The psychologist’s refusal to entertain seriously the possibility that literary 
creation can make a significant contribution to the understanding of human 
behavior rests, we believe, in what Karl Mannheim (9) would call “a one-sided 
and narrow orientation to. the problem of knowledge.” Consider the following 
observation, relative to the question of scientific vs. so-called pre-scientific 
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knowledge and understanding. It appears in Mannheim’s classic work, Ideology 
and Utopia. Compare it to the typical “holier-than-thou” attitude of the ex- 
perimental psychologist. We believe that the program suggested is an emi- 
nently sane and practical one, which might even have as a beneficial secondary 
effect, the re-establishment and strengthening of the psychologist’s contact with 
real life behavior. 


If, however, it is true, that life affords possibilities of 
knowledge and understanding even where science plays no 
part, it is no solution to designate such knowledge as “pre- 
scientific” or to relegate it to the sphere of “intuition,” 
simply in order to preserve the purity of an arbitrary defi- 
nition of “science.” On the contrary, it is above all our duty to 
inquire into the inner nature of these still unformulated types 
of knowledge and then to learn whether the horizons and 
conceptions of science cannot be so extended as to include 
these ostensibly pre-scientific areas of knowledge (p. 165). 


The problem of validating knowledge derived from non-systematic, re- 
flective observation can be best handled in two parts, corresponding to the 
two extreme instances of this form of knowledge. There is, first of all, knowledge 
derived from non-systematic, reflective observation that is confirmed in sys- 
tematic analytical observation, clinical observation, or experimental analysis. 
This confirmation may be either intentional or accidental, witting or unwitting. 
Assuming that we can eventually agree on a set of criteria for the validity of 
dinically derived knowledge, such confirmation would seem to provide pre- 
sumptive support for the validity of the non-systematically, reflectively derived 
knowledge. 


One of the best examples of this manner of validating knowledge can be 
found in the frequent clinical confirmation of ambivalence as an important 
characteristic of affective behavior. Freud (2) attributes the coining of this term 
to Bleuler. Yet, but for the naming of the behavior, the ambivalent reaction was 
described with great insight over two thousand years ago by Lucretius (8), the 
celebrated Roman poet. In Book IV of his poem, The Nature of the Universe, in 
which Lucretius discusses sensation and sex, he writes: 


Do not think that by avoiding grand passions you are 
missing the delights of Venus. Rather, you are reaping such 
profits as carry with them no penalty. Rest assured that this 
pleasure is enjoyed in a purer form by the healthy than by 
the love-sick. Lovers’ passion is storm-tossed, even in the 
moment of fruition, by waves of delusion and incertitude. 
They cannot make up their mind what to enjoy first with eye 
or hand. They clasp the object of their longing so tightly that 
the embrace is painful. They kiss so fiercely that teeth are 
driven into lips. All this because their pleasure is not pure, 
but they are goaded by an underlying impulse to hurt the 
thing, whatever it may be, that gives rise to these budding 
shoots of madness (pp. 163-164). 

This example is but one of many that could be cited in which non-sys- 


tematically, reflectively derived knowledge, through clinical or experimental 
confirmation, clearly makes a contribution to psychological understanding. 


Two thousand years, we would all agree, is a long time to wait for clinical 
confirmation of a naturalistic observation. Surely, there must be some alterna- 


tive solution to the problem of validity of non-systematic, reflective knowledge. 
The search for such a solution becomes imperative when we consider the second 
class of such knowledge, namely, non-systematically, reflectively derived knowl. 
edge which, either because it is too specific, as in the creation of a particular 
fictional character, or too complex, as in the fictionalized representation of an 
interpersonal relationship, cannot be validated in the laboratory, or in the 
clinic, or through systematic analytical observation. What is the status of this 
knowledge as seen from the scientific perspective of the psychologist? What 
tests of validity can be applied here? 


The suggestion that we have to make is not an original one. In fact, it is 
its non-originality that imparts to it most of its significance. For it points to the 
continuing failure of modern psychology to develop methods of investigation 
adequate to the task of understanding real life human behavior. The sugges- 
tion is simply this: Pending the extension of the “horizons and conceptions” of 
psychological science so as “to include these ostensibly pre-scientific areas of 
knowledge,” we vigorously recommend the witting, circumspect use of non- 
systematic, reflective knowledge as bona fide data in hypothesis and theory con- 
struction. Many psychologists, particularly those who are psychoanalytically 
oriented, already make use of this material, but for illustrative purposes only. 
What we are arguing for here is an inductive use of such knowledge, much as 
Allport has argued for an inductive use of personal documents (1). Perhaps the 
suggestion can be stated more simply. To wit: As long as psychologists continue 
to deny their birthright and leave to others—novelists, poets, playrights, social 
critics—the responsibility of analyzing, describing, or explaining the natural 
behavior-world, we had best make whatever effective use we can of their in- 
sights and observations. It would, | think, be stupidly presumptuous of us to ex- 
pect the novelist or playright to adopt the methods of science as a precondition 
for recognizing the value of their work. The poet, the novelist, the critic, each 
have their own methods of arriving at understanding. If psychologists believe 
that the methods of science are superior ways of understanding the natural 
behavior-world, let them turn to this world and demonstrate the superiority of 
their methods. One thinks here immediately of Piaget and of the very fruitful 
work that he has done. But Piaget is an exception, a notable one to be sure, but 
an exception nonetheless. 


It appears to us extremely unlikely that the insights and observations of the 
novelist or poet can be directly validated by experimental techniques. The com- 
plex events of the natural behavior-world are too spontaneous, have too much 
vitality to be faithfully recreated in the laboratory. By comparison, the pros- 
pects for clinical validation seem much brighter. Probably, the best procedure 
for validating naturalistic knowledge is to determine how well our understanding, 
prediction, and control of other observational events is improved through the 
use of this knowledge. Much of this material, many psychologists would agree, 
seems to have a high “subjective” validity. It is time we dipped into the store of 
riches freely offered by the poet and novelist and made effective use of it. 


In this paper we have argued for a pluralistic approach to the understanding 
of human behavior. Underlying this plea is a genuine fear that the psycholo- 
gist’s obsessive concern with scientific respectability may blind him to the ave- 
nues of understanding opened up by a non-experimental, observational type 
analysis. We agree with Krutch—although probably had we had the happy 
fortune to have first formulated this criticism, we would have spoken more cir- 
cumspectly—that: 
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Perhaps Hamlet was nearer right than Pavlov. Perhaps the 
exclamation “How like a god!” is actually more appropriate 
than “How like a dog! How like a rat! How like a machine!” 
Perhaps we have been deluded by the fact that the methods 
employed for the study of man have been for the most part 
those originally devised for the study of machines or the 
study of rats, and are capable, therefore, of detecting and 
measuring only those characteristics which the three do have 
in common. But we have already gone a long way on the 
contrary assumption, and we take it more completely for 
granted than we sometimes realize. The road back is not an 
easy one (5, pp. 32-33). 
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